Speech Recognition by Dereverberation Method Based on Multi-channel LMS Algorithm in Noisy Reverberant Environment

نویسندگان

  • Kyohei Odani
  • Longbiao Wang
چکیده

1 Introduction In a distant-talking environment, channel distortion drastically degrades speech recognition performance because of mismatches between the training and test environments. The current approaches focusing on robustness issues for automatic speech recognition (ASR) in noisy reverberant environments can be classified as speech enhancement, robust feature extraction, or model adaptation methods. In this paper, we focus on speech enhancement in a distant-talking environment. Previously, Wang et al. [1] proposed a robust distant-talking speech recognition method based on power spectral subtraction (SS) employing the adaptive multi-channel least mean squares (MCLMS) algorithm. In their study, late reverberation was treated as additive noise, and a noise reduction technique based on power SS was proposed to estimate the power spectrum of clean speech using an estimated power spectrum of the impulse response. To estimate the power spectra of the impulse responses, they extended the MCLMS algorithm for identifying impulse responses in a time domain [2] to a frequency domain. We proposed a blind dereverberation method based on generalized SS (GSS), which has been shown to be effective for noise reduction, instead of power SS [3]. The dereverberation method based on GSS with beamforming achieved a relative word error reduction rate of 9.8% and 31.4% compared to the dereverberation method based on power SS with beamforming and the conventional cepstral mean normalization (CMN) with beamforming, respectively. However, both the power SS-based method [1] and GSS-based method [3] were evaluated in a simulated reverberant environment without additive noise. In this paper, we evaluate the blind dereverberation methods in a real noisy reverberant environment with stationary noise or non-stationary noise. To suppress the stationary noise and non-stationary noise, we use a noise reduction technique based on GSS and a blind source separation based on independent component analysis (ICA), respectively. FastICA is one of the most popular algorithm for ICA. Our experimental result shows that combining the Efficient FastICA (EFICA) [4], which is an improved version of FastICA, is effective for our dereverberation method [6]. The schematic diagram of our dereverberation method is shown in Fig. 1. The late reverberation are reduced from the spectrum of multi-channel distorted speech by our dereverberation method using the estimated spectrum of impulse response. Thereafter, the early reverberation is normalized by CMN at the feature extraction stage. 2.1 Dereverberation based on GSS If speech s[t] is corrupted by convolutional noise h[t], the observed speech x[t] becomes x[t] = h[t] * s[t], where …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-step linear prediction based speech dereverberation in noisy reverberant environment

A speech signal captured by a distant microphone is generally contaminated by reverberation and background noise, which severely degrade the automatic speech recognition (ASR) performance. In this paper, we first extend a previously proposed single channel dereverberation algorithm to a multi-channel scenario. The method estimates late reflections using multichannel multi-step linear prediction...

متن کامل

Title Placeholder

A speech signal captured by a distant microphone is generally contaminated by reverberation and background noise, which severely degrade the automatic speech recognition (ASR) performance. In this paper, we first extend a previously proposed single channel dereverberation algorithm to a multi-channel scenario. The method estimates late reflections using multichannel multi-step linear prediction...

متن کامل

Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array

A blind dereverberation method based on power spectral subtraction (SS) using a multi-channel least mean squares algorithm was previously proposed to suppress the reverberant speech without additive noise. The results of isolated word speech recognition experiments showed that this method achieved significant improvements over conventional cepstral mean normalization (CMN) in a reverberant envi...

متن کامل

Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment

A blind dereverberation method based on spectral subtraction using a multi-channel least mean squares algorithm was previously proposed. The results of a large vocabulary continuous speech recognition task showed that this method achieved significant improvements over the conventional method based on cepstral mean normalization and beamforming in a simulated reverberant environment without addi...

متن کامل

Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation

Previously, a dereverberation method based on generalized spectral subtraction (GSS) using multi-channel least mean-squares (MCLMS) has been proposed. The results of speech recognition experiments showed that this method achieved a significant improvement over conventional methods. In this paper, we apply this method to distant-talking (far-field) speaker recognition. However, for far-field spe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013